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METHOD AND APPARATUS FOR INTERNET CACHE 
CONTENT DELIVERY VIA A DATA DISTRIBUTION SYSTEM 



BACKGROUND 

The present invention relates generally to data distribution systems, and more 
particularly, to improved methods and apparatus for providing Internet cache content 
delivery using a data distribution system. 
5 The assignee of the present invention has developed a data distribution system, 

and in particular, a satellite-based data distribution system, that is used to distribute data 
supplied by content providers to personal computers of clients. In order to allow clients 
to have improved response times to obtain data distributed by the data distribution 
system, an important aspect of the system was developed relating to caching services 

10 that addresses this need. 

Several companies, including Inktomi, Cache Flow, and NetWork Appliance, 
for example, currently make cache appliances or specific software which are targeted at 
Internet service providers. SkyCache is currently operating caching services using 
satellite communications. 

15 It would be an improvement to have a method and apparatus that provides 

Internet cache content delivery using a data distribution system. 

SUMMARY OF THE INVENTION 

The present invention provides for a cache system and caching method that is 
20 used to improve data delivery service provided by a data distribution system, such as a 
satellite-based data distribution system. The present invention uses a master cache in 
which predictive harvesting of content is performed based upon probability 
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distributions of individual caches satisfying requests from their users (clients). Content 
in the master cache is distributed to user sites by way of the satellite-based data 
distribution system, and local caches at user sites are automatically loaded with 
incoming content. The use of the satellite-based data distribution system provides an 
5 efficient economic multiplier. 

More specifically, an exemplary system comprises a master cache that receives 
content for distribution by the data distribution system to one or more users. A 
gateway receives content distributed by the data distribution system from the master 
cache. One or more local caches store the content received by the gateway destined for 

10 the one or more users. Harvesting software processes information from the master 

cache and the gateway that corresponds to probability distributions that the local caches 
satisfy requests from their respective users to predictively distribute the desired content 
to the respective users. 

Content distributed by the data distribution system to the gateway is transferred 

1 5 into the local cache by creating a pseudo client on the gateway, receiving an interrupt at 
the pseudo client indicating that that content has arrived at the gateway, enabling the 
gateway as a sibling cache for the local cache, requesting content to be transferred from 
the sibling cache to the local cache, verifying that content has been transferred to the 
local cache during the transfer process, and disabling the gateway as a sibling cache of 

20 the local cache at the completion of the process. The local cache retrieved the content 
from the sibling cache until all content has been transferred. 

The master cache is built by processing statistics derived from the master cache 
and the local cache to produces a list of content to add to the master cache and a list 64 
of content to delete from the master cache. A pseudo client is formed that retrieves and 

25 verifies the content to be added to the master cache. The verified content is then 
transmitted from the master cache to the local cache. 

Caching is used to improve the response time of a user (client) that is served by 
the cache system employed in the data distribution system. Caching also benefits 
service providers by reducing communications costs between the service environment 

30 and the external world. Using satellite transmission services allows cost savings and 
response times to be further enhanced for a large number of service providers by 
allowing aggregation of their individual requests. 

The use of the satellite-based data distribution system provides for efficient 
distribution of cache content over a wide geographical area thereby allows many user 

35 sites to simultaneously update their local caches. This reduces cost of terrestrial 
infrastructure for the individual user sites. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The various features and advantages of the present invention may be more 
readily understood with reference to the following detailed description taken in 
conjunction with the accompanying drawings, wherein like reference numerals 
5 designate like structural elements, and in which: 

Fig. 1 illustrates the architecture of an exemplary caching system in accordance 
with the principles of the present invention used in conjunction with a satellite-based 
data distribution system; 

Fig. 2 illustrates exemplary apparatus and a method in accordance with the 
10 present invention for forcing content into a local cache; and 

Fig. 3 illustrates an exemplary method in accordance with the present invention 
for building the master cache. 

DETAILED DESCRIPTION 

15 Referring to the drawing figures, Fig. 1 illustrates the architecture of an 

exemplary caching system 10 in accordance with the principles of the present invention 
the invention is used in conjunction with a data distribution system 30, such as a 
satellite-based data distribution system 30 developed by the assignee of the present 
invention or an RF data distribution system, for example. An exemplary satellite-based 

20 data distribution system 30 comprises a network operations center 1 1 that includes 
subsystems and software that permits use of the data distribution system 30 by 
subscribers (users). This is illustrated as subscriber entitlement 12. 

The subscriber entitlement subsystem originates entitlement messages, which 
activates the entitled gateways to decode the data transmitted over the data distribution 

25 system 10. 

The satellite-based data distribution system 30 distributes content received from 
service providers by means of a satellite multicast feed 13 distributed by way of a 
satellite 14 to a gateway 15, referred to as a CyberStar Enterprise Gateway 15, and 
which is part of the present caching system 10. The content is typically transmitted in 

30 the form of http (hypertext transport protocol) objects such as web pages or nttp (News 
text transport protocol) objects such as news groups. 

Content received by the gateway 15 is transferred to a local cache 20 which is 
typically located on a user computer or server computer at a user site. For example, the 
content is stored in the local cache 20 on a server computer and is transferred to a user 

35 computer that is coupled to a user network 24 by way of a switch 22, for example. A 
router 23 is used to connect the user network 24 to the Internet 25. 
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The exemplary caching system 10 also includes a master cache 18 and harvest- 
ing software 17 that is typically disposed on a computer 19 at the network operations 
center 11. The computer 19 is coupled to the Internet 25 and receives content from 
content providers for distribution. The computer 19 is coupled by way of a backchan- 
5 nel 16 to the gateway 15. The backchannel 16 may be implemented in a variety of 
ways, including a dedicated narrowband circuit (telephone), a virtual private 
narrowband channel, or an internet connection, for example. The backchannel 16 is 
used to transmit hit/miss data and probability tables from the gateway 15 to the 
computer 19, and the harvesting software 17 processes information contained in the 

1 0 transmit hit/miss data and probability tables. 

Fig. 2 illustrates exemplary apparatus 40 and a method 50 in accordance with 
the present invention for forcing content into the local cache 20. The apparatus 40 
comprises the gateway 15 which is coupled to the local cache 20 as discussed with 
reference to Fig. 1. The method 50 is implemented in the gateway 15 and processes 

1 5 content received from the satellite 14, typically in the form of a multicast satellite feed. 
The method 50 comprises the following steps. 

Received content 51 is forced into the local cache 20 using the method 50 as 
follows. An entitlement message is received 52 by the gateway 15 from the subscriber 
entitlement subsystem that notifies 53 a pseudo client 41 that new objects have arrived. 

20 The pseudo client 41 is a computer program which runs on the gateway computer and 
uses the appropriate communication protocol to communicate with the local cache. An 
example of this protocaol is ICP (Internet Cache Protocol). However, the protocol may 
also be a proprietary one from a number of cache hardware vendors. The pseudo client 
41 also has processes that allow it to control or designate the presedence relationship 

25 beween the local cache and the gateway. 

The pseudo client 41 enables 54 the gateway 15 as a sibling cache 42 for the 
local cache 20. The standard Internet Protocol (IP), or Internet Cache Protocol (ICP), 
is used to communicate between the local cache 20 and the sibling cache 42. The 
pseudo client 41 then requests 55a the new objects 53 having arrived on the sibling 

30 cache 42. This request 55a makes use of standard control settings which force the local 
cache 20 to render any objects resident at the local cache as unable to satisfy the request 
55a. The local cache 20 then requests 55b the object from its sibling cache 42. The 
sibling cache 42 complies by sending 56 the object to the local cache 20. The local 
cache 20 in turn fowrards the object to the pseudo client 41 thus verifying 57 the object 

35 has been transferred to the local cache 20. The pseudo client 41 continues to request 
55a objects from the local cache 20 until all the new objects hve been transferred to the 
local cache 20. During the transfer process, the pseudo client 41 verifies 57 that all 
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pages have been transferred to the local cache 20. At the completion of the process, the 
pseudo client 41 disables 58 the gateway 15 as a sibling cache 42 of the local cache 20. 

Fig. 3 illustrates an exemplary method 60 in accordance with the present 
invention for building the master cache 18. The harvest analysis software 17 processes 
5 statistics 61 from the master cache 18 and all local caches 20. Parameters and statistics 
used by the harvest analysis software are, for example, time to live (TTL) the number 
of times an object was requested, the number of times an object was requested and 
satisfied by local cache 20 (Hit), the number of times an object was requested and 
satisfied by a server other than the local cache 20 (miss). Objects are added/dropped 
10 according to high mis counts and low hit counts. The TTL parameter can also be 

adjusted to reduce the object's miss count. The statistics are collected by the local cache 
20 as part of its normal operation. These statistics are periodically transferred to the 
gate way 15. 

The harvest analysis software 17 produces a list 62 of objects (corresponding to 
15 each item of content) to add to the master cache 18 and a list 64 of objects to delete from 
the master cache 18. A pseudo client 41a is used to retrieve 65 and verify 66 the objects 
to be added to the master cache. The master cache forwards 67 the content to the 
satellite NOC for transmission according to various distribution policies. The satellite 
NOC adds a level of conditional access onto the transmission stream to provide service 
20 to only those sites which have been entitled to the cache service. 

Thus, an improved method and apparatus for providing Internet cache content 
delivery using satellite transmission services have been disclosed. It is to be 
understood that the described embodiments are merely illustrative of some of the many 
specific embodiments that represent applications of the principles of the present 
25 invention. Clearly, numerous and other arrangements can be readily devised by those 
skilled in the art without departing from the scope of the invention. 
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What is claimed is: 

1. A caching system for use with a data distribution system, comprising: 

a master cache for receiving content for distribution by the data distribution 
system to one or more users; 

a gateway for receiving content that is distributed by the data distribution system 
from the master cache; 

one or more local caches for storing the content received by the gateway 
destined for the one or more users; and 

harvesting software coupled to the master cache and the gateway for processing 
information corresponding to probability distributions that the local caches satisfy 
requests from their respective users to predictively distribute the desired content to the 
respective users. 

2. The system recited in Claim 1 wherein the harvesting software processes 
information contained in transmit hit/miss data and probability tables generated at the 
gateway. 

3. The system recited in Claim 1 wherein the content comprises http objects. 

4. The system recited in Claim 1 wherein the content comprises nttp objects. 

5. The system recited in Claim 1 wherein the gateway comprises: 

a pseudo client for receiving an entitlement message indicating that that content 
has arrived at the gateway, for enabling the gateway as a sibling cache for the local 
cache, for requesting content to be transferred from the sibling cache to the local cache, 
for verifying that content has been transferred to the local cache during the transfer 
process, for disabling the gateway as a sibling cache of the local cache at the completion 
of the process; 

and wherein the local cache retrieves 56 the content from the sibling cache until 
all content has been transferred. 
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6. The method recited in Claim 5 wherein the Internet Protocol is used to 
communicate between the local cache and the sibling cache. 

7. The method recited in Claim 5 wherein the Internet Cache Protocol is used to 
communicate between the local cache and the sibling cache. 

8. The method recited in Claim 1 wherein the harvest analysis software: 
processes statistics derived from the master cache and the local caches to 

produces a list of content to add to the master cache and a list of content to delete from 
the master cache; 

5 forming a pseudo client to retrieve and verify 66 the content to be added to the 

master cache; and 

transmitting the verified content from the master cache to the local caches. 

9. A method for transferring content distributed by a data distribution system to 
a gateway into a local cache, comprising the steps of: 

creating a pseudo client on the gateway; 

receiving an interrupt at the pseudo client indicating that that content has arrived 
5 at the gateway; 

enabling the gateway as a sibling cache for the local cache; 
requesting content to be transferred from the sibling cache to the local cache; 
verifying that content has been transferred to the local cache during the transfer 
process; 

10 disabling the gateway as a sibling cache of the local cache at the completion of 

the process; 

causing the local cache to retrieve the content from the sibling cache until all 
content has been transferred. 

10. A method for building a master cache used to transfer content by way of a 
data distribution system to a local cache, comprising the steps of: 

processing statistics derived from the master cache and the local cache to 
produces a list of content to add to the master cache and a list of content to delete from 
5 the master cache; 

forming a pseudo client to retrieve and verify the content to be added to the 
master cache; and 

transmitting the verified content from the master cache to the local cache. 
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1 1 . The method recited in Claim 10 wherein the step of processing statistics 
comprises the step of processing information corresponding to probability distributions 
that the local caches satisfy requests from their respective users to predictively distribute 
the desired content to the respective users. 

12. The method recited in Claim 10 wherein the step of processing statistics 
comprises the step of processing information contained in transmit hit/miss data and 
probability tables. 



METHOD AND APPARATUS FOR INTERNET CACHE 
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ABSTRACT 

A cache system and caching methods that are used to improve data delivery 
service provided by a data distribution system. The present invention uses a master 
cache in which predictive harvesting of content is performed based upon probability 
distributions of individual caches satisfying requests from users. Content in the master 
cache is distributed to user sites by way of the data distribution system, and local 
caches at user sites are automatically loaded with incoming content. 
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